4 



* 



104-1 



\ 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: O'Donnell, Michael 
(ii) TITLE OF INVENTION: DNA POLYMERASE III HOLOE^fZYME 
(iii) NUMBER OF SEQUENCES: 60 

(iv) CORRESPONDENCE ADDRE 

(A) ADDRESSEE^ Nixe-n, HargixaVe/ Devan£/ & Doyle LLP 

(B) STREET : /Clint on Square, ]{p/0. Box / /L051 

(C) CITY: Rochester 

( D ) STATE : / New/ York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 14603 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-D0S/MS-DOS 

(D) SOFTWARE^ Pate"h<In . Re/ease #1.0, Version #1.30 



(vi) CURRENT APPLIC 

(A) . ' APPLICATION 

(B) FILING DATE: 2-8-^ 

(C) CLASSIFICATION: 



US 08/828,323 
L-1997 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NA^IE : Goldman, Michael /L. 

(B) REGISTRATION NUMBER: 3'0,727 

(C) REFERENCE/ DOCKET NUMBER: 19603/10213 



(ix) TELECOMMUN^CAT^ICdJ_INFORMATlON : 

(A) TELEPHONE.: ( 716 )' 263 -13 04 

(B) TELEFAX: (716) 263-1600 



(2) INFORMA1 



FOR SEQ ID NO:l 



(i)/SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Leu Arg Leu Tyr Pro Glu Gin Leu Arg Ala Gin Leu Asn Glu Gly 
15 10 15 

Leu Arg Ala Ala Tyr Leu Leu Leu Gly Asn Asp Pro 
20 25 




(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Ala Ala Tyr Leu Leu Leu Gly Asn Asp Pro Leu Leu Leu Gin Glu Ser 
15 10 15 

Gin Asp Ala Val Arg 
20 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



Ala Gin Glu Asn Ala Ala Trp Phe Thr Ala Leu Ala Asn Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 ine ar 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Val Glu Gin Ala Val Asn Asp Ala Ala His Phe Thr Pro Phe His Trp 
15 10 15 

Val Asp Ala Leu Leu Met Gly Lys 
20 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GTACAACCGA ATCATATGTT ACCCAGCGAG CTC 33 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) 


MOLECULE 


TYPE : DNA 


(genomic) 






(xi) 


SEQUENCE 


DESCRIPTION: SEQ ID NO : 6 : 






ATGATTCGGT 


TGTACCCGGA 


ACAACTCCGC 


GCGCAGCTCA 


ATGAAGGGCT 


GCGCGCGGCG 


60 


TATCTTTTAC 


TTGGTAACGA 


TCCTCTGTTA 


f nm^i /"T tv /""l TV TV TV 

TTGCAGGAAA 


GCCAGGACGC 


TGTTCGTCAG 


120 


GTAGCTGCGG 


CACAAGGATT 


CGAAGAACAC 


CACACTTTTT 


^■t^lTV f 1 11 f ir*^ TV ITT,^T,/"1 

CCATTGATCC 


CAACACTGAC 


180 


TGGAATGCGA 


TCTTTTCGTT 


ATGCCAGGCT 


ATGAGTCTGT 


TTGCCAGTCG 


ACAAACGCTA 


240 


TTGCTGTTGT 


T AC C AG AAAA 


CGGACCGAAT 


GCGGCGATCA 


tv m/*i tv /*i /~i tv tv /"i m 

ATGAGCAACT 


TCTCACACTC 


300 


ACCGGACTTC 


TGCATGACGA 


CCTGCTGTTG 


ATCGTCCGCG 


rn tv tv rri tv tv tv rprp 

GTAAIAAAI 1 


AAGCAAAGCG 


360 


CAAGAAAATG 


CCGCCTGGTT 


TACTGCGCTT 


GCGAATCGCA 


GCGTGCAGGT 


GACCTGTCAG 


420 


ACACCGGAGC 


AGGCTCAGCT 


TCCCCGCTGG 


GTTGCTGCGC 


GCGCAAAACA 


GCTCAACTTA 


480 


GAACTGGATG 


ACGCGGCAAA 


TCAGGTGCTC 


TGCTACTGTT 


ATGAAGGTAA 


CCTGCTGGCG 


540 


CTGGCTCAGG 


CACTGGAGCG 


TTTATCGCTG 


CTCTGGCCAG 


ACGGCAAATT 


GACATTACCG 


600 


CGCGTTGAAC 


AGGCGGTGAA 


TGATGCCGCG 


CATTTCACCC 


CTTTTCATTG 


GGTTGATGCT 


660 


TTGTTGATGG 


GAAAAAGTAA 


GCGCGCATTG 


CATATTCTTC 


AGCAACTGCG 


TCTGGAAGGC 


720 


AGCGAACCGG 


TTATTTTGTT 


GCGCACATTA 


CAACGTGAAC 


TGTTGTTACT 


GGTTAACCTG 


780 


AAACGCCAGT 


CTGCCCATAC 


GCCACTGCGT 


GCGTTGTTTG 


ATAAGCATCG 


GGTATGGCAG 


840 


AACCGCCGGG 


GCATGATGGG 


CGAGGCGTTA 


AATCGCTTAA 


GTCAGACGCA 


GTTACGTCAG 


900 


GCCGTGCAAC 


TCCTGACACG 


AACGGAACTC 


ACCCTCAAAC 


AAGATTACGG 


TCAGTCAGTG 


960 


TGGGCAGAGC 


TGGAAGGGTT 


ATCTCTTCTG 


TTGTGCCATA 


AACCCCTGGC 


GGACGTATTT 


1020 


ATCGACGGTT 


GA 










1032 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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( D ) TOPOLOGY : 1 ine ar 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



CCGAACAGCT GATTCGTAAG CTGCCAAGCA TCCGTGCTGC GGATATTCGT TCCGACGAAG 



60 



AACAGACGTC GACCACAACG GATACTCCGG CAACGCCTGC ACGCGTCTCC ACCACGCTGG 



120 



GTAACTG 



127 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
TATGAAATCT TTACAGGCTC TGTTTGGCGG CACCTTTGAT CCGGTGCACT ATGGTCATCT 60 
AAAACCCGTT GGAAGCGTGG CCGAAGTTTT GATTGGTCTG AC 102 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met lie Arg Leu Tyr Pro Glu Gin Leu Arg Ala Gin Leu Asn Glu Gly 
15 10 15 
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Leu Arg Ala Ala Tyr Leu Leu Leu Gly Asn Asp Pro Leu Leu Leu Gin 
20 25 30 

Glu Ser Gin Asp Ala Val Arg Gin Val Ala Ala Ala Gin Gly Phe Glu 
35 40 45 

Glu His His Thr Phe Ser lie Asp Pro Asn Thr Asp Trp Asn Ala lie 
50 55 60 

Phe Ser Leu Cys Gin Ala Met Ser Leu Phe Ala Ser Arg Gin Thr Leu 
65 70 75 80 

Leu Leu Leu Leu Pro Glu Asn Gly Pro Asn Ala Ala lie Asn Glu Gin 

85 90 95 

Leu Leu Thr Leu Thr Gly Leu Leu His Asp Asp Leu Leu Leu lie Val 
100 105 110 

Arg Gly Asn Lys Leu Ser Lys Ala Gin Glu Asn Ala Ala Trp Phe Thr 
115 120 125 

Ala Leu Ala Asn Arg Ser Val Gin Val Thr Cys Gin Thr Pro Glu Gin 
130 135 140 

Ala Gin Leu Pro Arg Trp Val Ala Ala Arg Ala Lys Gin Leu Asn Leu 
145 150 155 160 

Glu Leu Asp Asp Ala Ala Asn Gin Val Leu Cys Tyr Cys Tyr Glu Gly 

165 170 175 

Asn Leu Leu Asn Leu Ala Gin Ala Leu Glu Arg Leu Ser Leu Leu Trp 
180 185 190 

Pro Asp Gly Lys Leu Thr Leu Pro Arg Val Glu Gin Ala Val Asn Asp 
195 200 205 

Ala Ala His Phe Thr Pro Phe His Trp Val Asp Ala Leu Leu Met Gly 
210 215 220 

Lys Ser Lys Arg Ala Leu His lie Leu Gin Gin Leu Arg Leu Gly Gly 
225 230 235 240 

Ser Glu Pro Val lie Leu Leu Arg Thr Leu Gin Arg Glu Leu Leu Leu 

245 250 255 

Leu Val Asn Leu Lys Arg Gin Ser Ala His Thr Pro Leu Arg Ala Leu 
260 265 270 
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Phe Asp Lys His Arg Val Trp Gin Asn Arg Arg Gly Met Met Gly Glu 
275 280 285 

Ala Leu Asn Arg Leu Ser Gin Thr Gin Leu Arg Gin Ala Val Gin Leu 
290 295 300 

Leu Thr Arg Thr Glu Leu Thr Leu Lys Gin Asp Tyr Gly Gin Ser Val 
305 310 315 320 

Trp Ala Glu Leu Glu Gly Leu Ser Leu Leu Leu Cys His Lys Pro Leu 

325 330 335 



Ala Asp Val Phe lie Asp Gly 
340 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Arg Trp Tyr Pro Trp Leu Arg Pro Asp Phe Glu Lys Leu Val Ala 
15 10 15 

Ser Tyr Gin Ala Gly Arg Gly His His Ala Leu Leu lie Gin Ala Leu 
20 25 30 

Pro Gly Met Gly Asp Asp Ala Leu lie Tyr Ala Leu Ser Arg Tyr Leu 
35 40 45 

Leu Cys Gin Gin Pro Gin Gly His Lys Ser Cys Gly His Cys Arg Gly 
50 55 60 

Cys Gin Leu Met Gin Ala Gly Thr His Pro Asp Tyr Tyr Thr Leu Ala 
65 70 75 80 

Pro Glu Lys Gly Lys Asn Thr Leu Gly Val Asp Ala Val Arg Glu Val 

85 90 95 
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Thr Glu Lys Leu Asn Glu His Ala Arg Leu Gly Gly Ala Lys Val Val 
100 105 110 

Trp Val Thr Asp Ala Ala Leu Leu Thr Asp Ala Ala Ala Asn Ala Leu 
115 120 125 

Leu Lys Thr Leu Glu Glu Pro Pro Ala Glu Thr Trp Phe Phe Leu Ala 
130 135 140 

Thr Arg Glu Pro Glu Arg Leu Leu Ala Thr Leu Arg Ser Arg Cys Arg 
145 150 155 160 

Leu His Tyr Leu Ala Pro Pro Pro Glu Gin Tyr Ala Val Thr Trp Leu 

165 170 175 

Ser Arg Glu Val Thr Met Ser Gin Asp Ala Leu Leu Ala Ala Leu Arg 
180 185 190 

Leu Ser Ala Gly Ser Pro Gly Ala Ala Leu Ala Leu Phe Gin Gly Asp 
195 200 205 

Asn Trp Gin Ala Arg Glu Thr Leu Cys Gin Ala Leu Ala Tyr Ser Val 
210 215 220 

Pro Ser Gly Asp Trp Tyr Ser Leu Leu Ala Ala Leu Asn His Glu Gin 
225 230 235 240 

Ala Pro Ala Arg Leu His Trp Leu Ala Thr Leu Leu Met Asp Ala Leu 

245 250 255 

Lys Arg His His Gly Ala Ala Gin Val Thr Asn Val Asp Val Pro Gly 
260 265 270 

Leu Val Ala Glu Leu Ala Asn His Leu Ser Pro Ser Arg Leu Gin Ala 
275 280 285 

lie Leu Gly Asp Val Cys His lie Arg Glu Gin Leu Met Ser Val Thr 
290 295 300 

Gly lie Asn Arg Glu Leu Leu lie Thr Asp Leu Leu Leu Arg lie Glu 
305 310 315 320 

His Tyr Leu Gin Pro Gly Val Val Leu Pro Val Pro His Leu 

325 330 



111619 




104-9 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACTCTGGAAG AACCGCCGGC TGAAACTTGG TTTTTTCTGG CTACTCGTGA ACCGGAA 57 



(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GCTGGTTCTC CGGGTGCTGC TCTGGCTCTG TTTCAGGGTG ATGACTGGCA GGCT 54 



(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1002 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

ATGAGATGGT ATCCATGGTT ACGACCTGAT TTCGAAAAAC TGGTAGCCAG CTATCAGGCC 6 0 

GGAAGAGGTC ACCATGCGCT ACTCATTCAG GCGTTACCGG GCATGGGCGA TGATGCTTTA 12 0 
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ATCTACGCCC 


TGAGCCGTTA 


TTTACTCTGC 


CAACAACCGC 


AGGGCCACAA 


AAGTTGCGGT 


180 


CACTGTCGTG 


GATGTCAGTT 


GATGCAGGCT 


GGCACGCATC 


CCGATTACTA 


CACCCTGGCT 


240 


CCCGAAAAAG 


GAAAAAATAC 


GCTGGGCGTT 


GATGCGGTAC 


GTGAGGTCAC 


CGAAAAGCTG 


300 


AATGAGCACG 


CACGCTTAGG 


TGGTGCGAAA 


GTCGTTTGGG 


TAACCGATGC 


TGCCTTACTA 


360 


ACCGACGCCG 


CGGCTAACGC 


ATTGCTGAAA 


ACGCTTGAAG 


AGCCACCAGC 


AGAAACTTGG 


420 


TTTTTCCTGG 


CTACCCGCGA 


GCCTGAACGT 


TTACTGGCAA 


CATTACGTAG 


TCGTTGTCGG 


480 


TTACATTACC 


TTGCGCCGCC 


GCCGGAACAG 


TACGCCGTGA 


CCTGGCTTTC 


ACGCGAAGTG 


540 


ACAATGTCAC 


AGGATGCATT 


ACTTGCCGCA 


TTGCGCTTAA 


GCGCCGGTTC 


GCCTGGCGCG 


600 


GCACTGGCGT 


TGTTTCAGGG 


AGATAACTGG 


CAGGCTCGTG 


AAACATTGTG 


TCAGGCGTTG 


660 


GCATATAGCG 


TGCCATCGGG 


CGATTGGTAT 


TCGCTGCTAG 


CGGCCCTTAA 


TCATGAACAA 


720 


GTCCCGGCGC 


GTTTACACTG 


GCTGGCAACG 


TTGCTGATGG 


ATGCGCTAAA 


ACGCCATCAT 


780 


GGTGCTGCGC 


AGGTGACCAA 


TGTTGATGTG 


CCGGGCCTGG 


TCGCCGAACT 


GGCAAACCAT 


840 


CTTTCTCCCT 


CGCGCCTGCA 


GGCTATACTG 


GGGGATGTTT 


GCCACATTCG 


TGAACAGTTA 


900 


ATGTCTGTTA 


CAGGCATCAA 


CCGCGAGCTT 


CTCATCACCG 


ATCTTTTACT 


GCGTATTGAG 


960 


CATTACCTGC 


AACCGGGCGT 


TGTGCTACCG 


GTTCCTCATC 


TT 




1002 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AAGAATCTTT CGATTTCTTT AATCGCACCC GCGCCCGCTA TCTGGAACTG GCAGCACAAG 60 

ATAAAAGCAT TCATACCATT GATGCCACCC AGCCGCTGGA GGCCGTGATG GATGCAATCC 12 0 

GCACTACCGT GACCCACTGG GTGAAGGAGT TGGACGC 157 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TTAGAGAGAC ATCATGTTTT TAGTGGACTC ACACTGCCAT CTCGATGGTC TGGATTATGA 60 

ATCTTTGCAT AAGGACGTGG ATGACGTTCT GGCGAAAGCC GCCGCACGCG ATGTGAAATT 12 0 

TTGTCTGGCA GTCGCCACAA CAT 143 
(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Arg Trp Tyr Pro Pro Leu Arg Pro Asp Phe Glu Lys Leu Val Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Glu Val Thr Glu Lys Leu Asn Glu His Ala Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Val Val Trp Val Thr Asp Ala Ala Leu Leu Thr Asp Ala Ala Ala Asn 
15 10 15 

Ala Leu Leu Lys 
20 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Thr Leu Glu Glu Pro Pro Ala Glu Thr Trp Phe Phe Leu Ala Thr Arg 
1 5 10 15 

Glu Pro Glu Arg Leu Leu Ala Thr Leu 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Leu His Tyr Leu Ala Pro Pro Pro Glu Gin Tyr Ala Val Thr Trp Leu 
15 10 15 

Ser Arg 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Leu Ser Ala Gly Ser Pro Gly Ala Ala Leu Ala Leu Phe Gin Gly Asp 
15 10 15 

Asn Trp Gin Ala Arg 
20 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

Leu Gly Gly Ala Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Ala Cys Thr Cys Thr Gly Gly Ala Ala Gly Ala Ala Cys Cys Gly Cys 
1 5 10 15 

Cys Gly Gly Cys Thr Thr Gly Ala Ala Ala Cys Thr Thr Gly Gly Thr 
20 25 30 

Thr Thr Thr Thr Thr Cys Thr Gly Gly Cys Thr Ala Cys Thr Cys Gly 
35 40 45 

Thr Gly Ala Ala Cys Cys Gly Gly Ala Ala 
50 55 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Gly Cys Thr Gly Gly Thr Thr Cys Thr Cys Cys Gly Gly Gly Thr Gly 
15 10 15 

Cys Thr Gly Cys Thr Cys Thr Gly Gly Cys Thr Cys Thr Gly Thr Thr 
20 25 30 

Thr Cys Ala Gly Gly Gly Thr Gly Ala Thr Ala Ala Cys Thr Gly Gly 
35 40 45 

Cys Ala Gly Gly Cys Thr 
50 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Gly Gly Thr Gly Ala Ala Gly Gly Ala Gly Thr Thr Gly Gly Ala Cys 
15 10 15 

Ala Thr Ala Thr Gly Ala Gly Ala Thr Gly Gly Thr Ala Thr Cys Cys 
20 25 30 

Ala 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Met Leu Lys Asn Leu Ala Lys Leu Asp Gin Thr Glu Met Asp Lys Val 
15 10 15 

Asn Val Asp Leu Ala Ala Ala Gly Val Ala Phe Lys Glu Arg Tyr Asn 
20 25 30 

Met Pro Val He Ala Glu Ala Val 
35 40 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
ATGCTGAAAA ACCTGGCTAA ACTGGATCAG ACTGAAATGG ATAAAGTTAA CGTTGAT 57 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CTGGCTGCTG CTGGTGTTGC TTTTAAGGAA CGTTATAACA TGCCGGTTAT TGCTGAA 57 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATGCTGAAGA ATCTGGCTAA ACTGGATCAA ACAGAAATGG ATAAAGTGAA TGTCGATTTG 60 

GCGGCGGCCG GGGTGGCATT TAAAGAACGC TACAATATGC CGGTGATCGC TGAAGCGGTT 120 

GAACGTGAAC AGCCTGAACA TTTGCGCAGC TGGTTTCGCG AGCGGCTTAT TGCCCACCGT 180 

TTGGCTTCGG TCAATCTGTC ACGTTTACCT TACGAGCCCA AACTTAAA 228 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

AGGCGTAGCG AAGGGAGCGT GCAGTTGAAG CCATATTATC TATTCCTTTT TGTAATAACT 6 0 

TTTTTACAGA CGATAACCTT GTCTAATGTC TGAGTCGAGG ATCATCAATT CCGGCTTGCC 120 

ATCCTGGCTC ACTCTTAGTA ACTTTTGCCC GCGAATGATG AGGAGATTAA GA 172 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



TAAAACTTAT ACAGAGTTAC ACTTTCTTAC ATAACGCCTG CTAAATTATG AGTATTTTCT 



60 



AAACCGCACT CATAATTTGC AGTCATTTTG AAAAGGAAGT CATTATG 



107 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Leu Lys Asn Leu Ala Lys Leu Asp Gin Thr Glu Met Asp Lys Val 

15 10 15 

Asn Val Asp Leu Ala Ala Ala Gly Val Ala Phe Lys Glu Arg Tyr Asn 
20 25 30 

Met Pro Val lie Ala Glu Ala Val Glu Arg Glu Gin Pro Glu His Leu 



Arg Ser Trp Phe Arg Glu Arg Leu lie Ala His Arg Leu Ala Ser Val 



35 



40 



45 



50 



55 



60 



Asn 
65 



Leu Ser Arg Leu Pro Tyr Glu Pro Lys Leu Lys 

70 75 



(2) 



INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Met Leu Lys Asn Leu Ala Lys Leu Asp Gin Thr Glu Met Asp Lys Val 
15 10 15 

Asn Val Asp Leu Ala Ala Ala Gly Val Ala Phe Lys Glu Ala Tyr Asn 
20 25 30 

Met Pro Val lie Ala Glu Ala Val 
35 40 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
ATGCTGAAAA ACCTGGCTAA ACTGGATCAG ACTGAAATGG ATAAAGTTAA CGTTGAT 57 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



CTGGCTGCTG CTGGTGTTGC TTTTAAAGAA CGTTATAACA TGCCGGTTAT TGCTGAA 



57 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
ATGATGAGGA GATTACATAT GCTGAAGAAT CTG 33 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GAGGAATTCG GCTTTTTTGC CGAATTCCTC GGCCCCTAGG AGATCTCAGC T 51 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Met Thr Ser Arg Arg Asp Trp Gin Leu Gin Gin Leu Gly He Thr Gin 
15 10 15 

Trp Ser Leu Arg Arg Pro Gly Ala Leu Gin Gly Glu He Ala He Ala 
20 25 30 

He Pro Ala His Val Arg Leu Val Met Val Ala Asn Asp Leu Pro Ala 
35 40 45 

Leu Thr Asp Pro Leu Val Ser Asp Val Leu Arg Ala Leu Thr Val Ser 
50 55 60 

Pro Asp Gin Val Leu Gin Leu Thr Pro Glu Lys He Ala Met Leu Pro 
65 70 75 80 

Gin Gly Ser His Cys Asn Ser Trp Arg Leu Gly Thr Asp Glu Pro Leu 

85 90 95 

Ser Leu Glu Gly Ala Gin Val Ala Ser Pro Ala Leu Thr Asp Leu Arg 
100 105 110 

Ala Asn Pro Thr Ala Arg Ala Ala Leu Trp Gin Gin He Cys Thr Tyr 
115 120 125 

Glu His Asp Phe Phe Pro Gly Asn Asp 
130 135 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
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ATGACATCCC 


GACGAGACTG 


GCAGTTACAG 


CAACTGGGCA 


TTACCCAGTG 


GTCGCTGCGT 


60 


CGCCCTGGCG 


CGTTGCAGGG 


CGAGATTGCC 


ATTGCGATCC 


CGGCACACGT 


CCGTCTGGTG 


120 


ATGGTGGCAA 


ACGATCTTCC 


CGCCCTGACT 


GATCCTTTAG 


TGAGCGATGT 


TCTGCGCGCA 


180 


TTAACCGTCA 


GCCCCGACCA 


GGTGCTGCAA 


CTGACGCCAG 


AAAAAATCGC 


GATGCTGCCG 


240 


CAAGGCAGTC 


ACTGCAACAG 


TTGGCGGTTG 


GGTACTGACG 


AACCGCTATC 


ACTGGAAGGC 


300 


GCTCAGGTGG 


CATCACCGGC 


GCTCACCGAT 


TTACGGGCAA 


ACCCAACGGC 


ACGCGCCGCG 


360 


TTATGGCAAC 


AAATTTGCAC 


ATATGAACAC 


GATTTCTTCC 


CTGGAAACGA 


C 


411 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GGCGATTATA GCCATATGTT GGCGCGGTAT CGACGAATTT GCTATATTTG CGCCCCTGAC 60 
AACAGGAGCG ATTCGCT 77 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 



TGATTTACCG GCAGCTTACC ACATTGAACA ACGCGCCCAC GCCTTTCCGT GGAGTGAAAA 



60 



AACGTTTGCC AGCAACCAGG GCGAGCGTTA TCTCAACTTT CAG 



103 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
GATTCCATAT GACATCCCGA CGAGACT 



(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
GACTGGATCC CTGCAGGCCG GTGAATGAGT 3 0 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 



Leu Gly Thr Asp Glu Pro Leu Ser Leu Glu Glu Ala Gin Val Ala Ser 
15 10 15 

Pro 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

Ala Ala Leu Trp Gin Gin lie Cys Thr Tyr Glu His Asp Phe Phe Pro 
15 10 15 

Ala 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
CAACAGGAGC GATTCCATAT GACATCCCGA CG 32 
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(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 
GATTCGGATC CCTGCAGGCC GGTGAATGAG T 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 
CCCCACATAT GAAAAACGCG ACGTTCTACC 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



ACCCGGATCC AAACTGCCGG TGACATTC 



28 



(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

ATGAAAAACG CGACGTTCTA CCTTCTGGAC AATGACACCA CCGTCGATGG CTTAAGCGCC 60 

GTTGAGCACC TGGTGTGTGA AATTGCCGCA GAACGTTGGC GCAGCGGTAA GCGCGTGCTC 120 

ATCGCCTGTG AAGATGAAAA GCAGGCTTAC GCCCTGGATG AAGCCCTGTG GGCGCGTCCG 180 

GCAGAAAGCT TTGTTCCGCA TAATTTAGCG GGAGAAGGAC CGCGCGGCGG TGTACCGGTG 240 

GAGATCGCCT GGCCGCAAAA GCGTAGCAGC AGCCGGCGCG ATATATTGAT TAGTCTGCGA 3 00 

ACAAGCTTTG CAGATTTTGC CACCGCTTTT ACAGAAGTGG TAGACTTCGT TCCTCATGAA 360 

GATTCTCTGA AACAACTGGC GCGCGAACGC TATAAAGCCT ACCGCGTGGC TGGTTTCAAC 42 0 

CTGAATACGG CAACTTGGAA A 441 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



TAACGGCGAA GAGTAATTGC GTCAGGCAAG GCTGTTATTG CCGGATGCGG CGTGAACGCC 



60 



TTATCCGACC TACACAGCAC TGAACTCGTA GGCCTGATAA GACACAACAG CGTCGCATCA 



120 



GGCGCTGCGG TGTATACCTG ATGCGTATTT AAATCCACCA CAAGAAGCCC CATTT 



175 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
TAATGGAAAA GACATATAAC CCACAAGATA TCGAACAGCC GCTTTACGAG CACTGGGAAA 6 0 

AAAGCCAGGA AAGTTTCTGC ATCATGATCC CGCCGCCGAA 100 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Met Lys Asp Ala Thr Phe Tyr Leu Leu Asp Asn Asp Thr Thr Val Asp 
15 10 15 

Gly Leu Ser Ala Val Glu Gin Leu Val Cys Glu lie Ala Ala Glu Arg 
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Trp Arg Ser 
35 

Ala Tyr Arg 
50 



Gly Lys 
Leu Asp 



Val Pro His Asn Leu 
65 



Glu He Ala 



He Ser Leu 



Val Val Asp 
115 

Glu Arg Tyr 
130 

Thr Trp Lys 
145 



Trp Pro 
85 

Arg Thr 
100 

Phe Val 
Lys Ala 



Arg Val Leu 
40 

Glu Ala Leu 
55 

Ala Gly Glu 
70 

Gin Lys Arg 



Ser Phe Ala 



Pro Tyr Glu 
120 

Tyr Arg Val 
135 



He Ala Cys 

Trp Ala Arg 

Gly Pro Arg 
75 

Ser Ser Ser 
90 

Asp Phe Ala 
105 

Asp Ser Leu 
Ala Gly Phe 



Glu Asp 
45 

Pro Ala 
60 

Gly Gly 
Arg Arg 
Thr Ala 



Lys Gin 
125 

Asn Leu 
140 



Glu Lys Gin 

Glu Ser Phe 

Ala Pro Val 
80 

Asp He Leu 
95 

Phe Thr Glu 
110 

Leu Ala Arg 
Asn Thr Ala 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 

Met Lys Asn Ala Thr Phe Tyr Leu Leu Asp Asn Asp Thr Thr Val Asp 
15 10 15 

Gly Leu Ser Ala Val Glu Gin Leu Val Xaa Glu He Ala 
20 25 
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(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Val Leu He Ala Xaa Glu Asp Glu Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

Leu Asp Glu Ala Leu Trp Ala Ala Pro Ala Glu Ser Phe Val Pro His 
15 10 15 

Asn Leu Ala Gly Glu 
20 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

Gly Gly Ala Pro Val Glu lie Ala Trp Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 

Gly Phe Asn Leu Asn Thr Ala Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 
CCCCACATAT GAAAAACGCG ACGTTCTACC 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
ACCCGGATCC AAACTGCCGG TGACGTTC 2 8 
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